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Abstract 

We present an analysis of the performance of an elitist Evolutionary 
algorithm using a recombination operator known as 1-Bit-Swap on the 
Royal Roads test function based on a population. We derive complete, 
approximate and asymptotic convergence rates for the algorithm. The 
complete model shows the benefit of the size of the population and re- 
combination pool. 

1 Introduction 

Evolutionary Algorithms (EA) are a set of heuristic optimization tools, that are 
well-suited to problems with poorly-understood landscapes (sometimes known 
as black-box optimization). Despite a rich history in application, theoretical 
analysis has been lagging behind; while there have been some advances in recent 
years, the analysis has mainly been restricted to single-parent algorithms, which 
is an extremely limiting assumption. 

The algorithm that we analyse is described as a (/z + A) evolutionary algo- 
rithm, where fj, and A are the size of the population from which solutions are 
picked, and the size of the mating pool that is created from them, respectively. 
The + signifies that the algorithm is elitist, i.e., that the best solution at each 
iteration is always reproduced in the next, so that the best fitness in the popu- 
lation can never decrease. In this paper we consider the effect that population 
has on the expected time when the algorithm finds the optimal solution to the 
problem. We model the distribution of elite species in the population using a 
static model of a uniform distribution. Most work to date has considered only 
(1 + 1) EAs. The operator that we use to recombine solutions is the 1-bit-swap 
(IBS) operator that was described in [TSMH10] . 
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We derive an exact expression for the expected runtime of {p, + A)EAiss- 
Since this expression does not appear to have a closed form, we then develop 
an approximation to it and compute the asymptotic limit. Rather surprisingly, 
in the asymptotic expression the size of the population and recombination pool 
cancel out, and so they do not appear. 

1.1 Royal Roads Function 

The Royal Roads (RR) is a test function introduced in |MFH92| and analyzed 
in |Mit96] , where a population-based Evolutionary Algorithm was found to have 
underperformed a simpler heuristic Randomized Local Search (RLS), which con- 
tradicted the theoretical findings in the same article. The Royal Road was ini- 
tially developed to demonstrate the schemata theory of EAs. A string of length 
n is split into K consecutive bins or blocks (which we index as ki, k%, ■ ■ ■ , Kk)- 
All bins have the same size, M, so that n = KM. The fitness of the string is the 
sum of the fitness for each bin, and the fitness of each bin is M if all the bits in 
the bin have value 1, and otherwise. Thus, the possible fitness of strings with 
the Royal Road function are 0, M, 2M, . . . , KM, so the function has plateaus of 
fitness, where a large number of strings have the same fitness value. This means 
that single bit modifications to the solution string will generally not improve 
the fitness function. This means that within the plateau, the solutions created 
by the EA are expected to show a random walk behaviour. 

1.2 Past Work 

RR has not received much attention in recent EA literature, where the focus 
has been on the rather simpler OneMax (also known as Counting Ones) fitness 
function. However, in [SW03j an upper bound on the expected running time 
of 0(n 6 ) was found for a version of RR. In |Mit96j the bounds on convergence 
for RR were found to be 0(2 K logn), where n is the length of the string and K 
the length of the bin, up to a linear term tighter than the bound for the RLS 
(0(2 K NlogN)), although numerically RLS outperformed EA. This result does 
not involve the size of the population or recombination pool in any way. 

For the OneMax test function there has been rather more research on EAs, 
although (1 + 1), (fx + 1) and (1 + A) set-ups are still more widespread. In 
|HY02j it was shown that the effect of population is problem-specific, i.e., in- 
crease in population size may not improve performance at all. Very recently, 
in [CTCYli] . it was shown that populations of size O(logn) boost performance, 
while those of size ^(7^77) impair the progress of the algorithm (with the anal- 
ysis based on the TrapZeros multimodal function) and reduce the probability of 
global convergence. 
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1 


Initialize population size /x 




repeat for t generations: 


2 


select -| pairs of parents from the population using Tournament selection 




repeat -| times: 


3a 


select a bit at random in Parent 1 


3b 


select a bit at random in Parent 2 


3c 


swap values in the selected bits 


4 


after the recombination, keep a best species in the population, 




replace the rest with the best species from the pool 




Table 1: {fi + X)EA 1B s 
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Analyzed Algorithm: (fi + \)~EAibs 



The genetic operator that we consider in this paper is not the usual mutation 
operator. The k-Bit-Swap genetic operator (KBS) was introduced in TSMH10 . 
It contains some features of both mutation and uniform crossover and recom- 
bines information between two parents in a random manner. In this article we 
use 1-Bit-Swap (IBS), which picks exactly 1 bit from each parent uniformly at 
random. 

See Table [T] for the pseudocode of the algorithm. 

Tournament selection consists of picking two species at random and putting 
the fitter of the pair into the mating pool. 



3 Model Setup and Assumptions 

The main quantity we analyze in this article is the first hitting time of the global 
solution of the test problem: 

T % R = min{t > : f(a) = n} 

where A is the set of all possible populations that include a global solution. We 
want to find ~Et rr , the expectation of this time parameter, for the (/i + A) EA 
with IBS as the only genetic operator. 

3.1 Improvement process 

We start with the pessimistic assumption that each bin k starts with an equal 
number of Os and Is, which implies that the starting fitness of all elements of 
the population is 0. As the Royal Road fitness function makes incremental im- 
provements impossible to see, in order to measure the progress of the algorithm 
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wc introduce, in addition to the fitness function, an auxiliary function, in this 
case OneMax (where the fitness of a string is simply a count of the number of 
Is in it; for further reference see e.g. [CHS+09j l. We denote the value of the 
auxiliary function for a bin k as V K , which can theoretically have values between 
and M, although in practice they all start at M/2 because of our assumption 
above. Since only 1 bit is changed in a string at each iteration, only one bin can 
evolve at a time. In a slight abuse of notation, we refer to that bin as the 'active 
bin' and index it as k. Within a bin, the number of improvements that have 
already been made (i.e., the number of bits that were and have already been 
changed into Is) is denoted by the variable which starts at and increases to 
M/2. 

We restrict our attention to elite pairs in the recombination pool, i.e. pairs in 
which both parents are currently elite species. This is a limitation of our analysis 
that means that we underestimate the chance of success. The probability of 
selecting an elite pair in the recombination pool is: 

a 2 (a + 2(fi-a)) 2 (a(2 M - a)) 2 
"sol (a) = — 



M 4 

where a is the number of elite species in the population. 

Having selected the pair, the probability that as a results of swapping bits 
between them, a better species evolves is: 

_ 2(f -Q(§ + _ (M -2l){n + nM + 21) 

SWaP ~ n 2 - 2^2 

This probability comes from the fact that we want to select any in bin k in one 
of the parents and a 1 anywhere in the other parent. Obviously, as the number 
of Is in both parents grows, this probability grows too. In Section I4TT1 we also 
use the probability of failure: 

Pf = 1 -^swap 

3.2 Population and elitism assumptions 

We assume that each generation currently elite species in the population are 
distributed uniformly: 

a ~ Uniform^— ^ 

This is a static model, i.e. this distribution does not change throughout the 
run of the algorithm; we will consider a dynamic model for this distribution in 
future work. We also assume that the rate of elitism (the number of species 
saved for the next generation) is high enough, that is, high enough to keep all 
elite species. 
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4 Derivation of the expectation of convergence 
time 

We present three main results: exact, approximate and asymptotic. The latter 
two are necessary, since the complete one does not have a closed form. 

4.1 Exact expression 

We start with introducing the probability of failure to improve V K : 

P(G ) = P(Go\H s ) J2 P(Hj\a)P(a) (1) 

j=0 a=l 

where Hj is jth elite pair in the recombination pool A, and a is the number of 
elite species in the population \i. 

The probability to fail to improve a bit in a bin given I improvements so far 

is: 



P< Gm ) = iy a^-(M-2l)(n + KM + 2l) y 

^ ^ (a(a + 2^-a))) 2 y ^ _ (a(a + 2 M ( M - a))) 2 ^ ■ 

a=l ^ ^ 

= -ilnfi) £(P S ei(a)) J (l-P se i(a))^-^ 



= 7T E £ (T) (^Psei(a)F(l - P se M))^ 

= -E( 1 - P - 1 (a)P S wap)^ (2) 



a=l 



The last step is due to the Binomial expansion: J22=o (fc) ^™ k = (a + b) n . 
Therefore, the probability of an increase in the auxiliary function is: 



P(Gi) = 1 - P(Goi) = 1 - - ^(1 - P se i(a)P, ~ 



swap ; 

The expected time until the next improvement of the auxiliary function of 
a bin k is: 

M-l 
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and, finally, summing over all n from f to K we obtain (since G depends on 
both I and n): 



k f-i 



K=l 1 = 

K f-1 



K f-1 

(a((2/^ — a))) 2 (M — 2i)(n+KAf +2Z) \ A 
«=i ;=o 1 p 2^a=l\ L 2jF^ ) 



(4) 



The benefit of larger population sizes is clear in the - term in front of the sums 
in the denominator. Also, increases in the size of X lead to reductions in the 
probability of failure. 

We test this expression numerically for different values of n, (i, A (see Ap- 
pendix C). Unfortunately, this expression does not seem to exist in closed form, 
so we instead go ahead with finding an approximation to it in the next subsec- 
tion. 



4.2 Approximate and asymptotic expressions 

P{Gol ) = iy( 1 - + 2 ^ z Q ))) 2 ( M - 2l )( n + nM + 2l ) \ # 

it ^— : ; V 2u 4 n 2 / 

1 A/ (a(2fj. - a)f{M - 2l)(n + kM + 2/) \ 4 
~^^\ 2^ 4 n 2 J 

1 A A( Q (2 A ,- Q )) 2 (M-2i)(„ + ^M + 2i) 1 /"M -( serf ) 

« - V e ^ e \ ^ ' da (5) 

The last step in the summand was due to lim„^ 00 (l — i) = |. Note that 
7 = \( M _2i)(n+K.M+2i) ' anc ^' assumm g that M — ^! the upper bound on 7 is 
\{M-2t)(n+kM+2i) < ^3AS~ = O(^n). although for monotonically decreasing 
functions, such as this one, by the integral test the sum is larger than the 
corresponding integral; for \i « n the sum is closely approximated by the 
integral. 

We denote I\ = f(a)da — e V ^ ' da. Expanding the function 
inside the integral as a Taylor series around ao = 1 up to the second term, we 
get (since f(a Q ) = - 4(2 ^- 1) ): 



J M ,e-'^_ 4(2,-1)^-1X0-1) (6) 



G 



Therefore, the integral turns into: 



h 



= e ( ~> (p - 1) 



_ 4(2 M -l)( M -l)(q-l) \ a 



e ^ 7 



1 - 



2(2^ -l)(p-lf 



7 



(7) 



The probability of failure is approximately (with the assumptions specified 
above): 

P(G ol ) » 



Accordingly, the probability of a successful swap is: 



P(Gt) « 1 



2(2 M -l)( A1 -l) 2 



/i 



Using the sum of expectations of Geometric random variables with different 
parameters, the expected time until filling a bin, i.e. improvement of the fitness 
function, is: 



ET K = 



7 - 7 e (2 ^r )2 + 2(2^- 1)0- l) 2 e { ^7T )2 



(8) 



We make two approximations here. First, we use a Riemanian sums approxi- 
mation to obtain [0, 1] bounds on the integral, and then expand the integrand in 
Taylor series with 2 terms around the midpoint to obtain a good approximation 
of the integral. The Riemanian sums approximation is defined by: 



n -i 

lim \^f(xj)=n / f(nx)dx + o(n) 

.7=1 J ° 



and 7 is transformed accordingly: 



7 = 



4/j 4 n 2 



Then: 



(M _ 2 (f - l)Z)(n + kM + (f - 1)0 



o 7 - 7 e 1 ^ + 2(2^ - l)(/j - l) 2 e 1 J 



(9) 



Therefore, the expected first hitting time until the evolution of the bin k is 
(the rather long Taylor series expansion of the integrand is given in the Appendix 
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A): 



o 7 - 7 e K ^ ' + 2(2{i - - l) 2 e 1 ^ 
4^n 2 (f - 1) 



A(f + l)(f +n + KM-l) 



2(2^-l)Q-l) 2 4 M 4 n 2 4^ 4 n 

(10) 



where: 



■if' 1 



crj = e 

M , r , 

o"2 = — + n + kAJ — 1 
Note that <7i has the interesting property (given n — X) that: 

lim e 4 f 4 " 2 = lim e i^™ 2 = lim e M " V M " / = 1 

n— >oo n— >oo n— >oo 

which means, that for sufficiently large values of n and the second and the 
third terms in the square brackets cancel each other out, and the first term is 
just 2(2^i - l)fju - l) 2 . 

Finally, summing over all k, the number of bins in the string, we get the 
approximation of the convergence time of the [n + A) algorithm on RR test 
function: 



Et, 



RR 



2^n 2 (M-2) %^ 1 



( M +A)£A 1BS ~ A(M + 2)(2m _ _ 1)2 g m + n _ 1 + kM 

2^n 2 (M-2) r / f + n - 1 + M + KM\ /f + n-l- 



AM(M + 2)(2/z- 1) 

2^ 4 n 2 (M-2) / 2ifM 



r , / ^ + n - 1 + M + A M \ , / f + n - i \ - 

ifM- m ) -^rnsH. 



" XM(M + 2)(2^-l)(/i-l) 2l ° g l 1+ M + 2rJ ^ 

where ipo is a Digamma function (see e.g. [XS65 ( GKP95 ). In the derivation of 
the asymptotic expression for this bound, all population-related terms cancel 
out (since fj, = A and both numerator and denominator have the highest term 
fi 4 ), and the order of convergence is 

2l„„ (l i KM 



vr^ x)EAlBS = o( nlos[1 M M+n ' i 



which seems to be a comparable result compared to those available in literature 
covering fitness functions with plateaus of fitness (e.g. |HY02(ICHS+09llSW03] ) . 
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5 Conclusions and Future Work 



We have derived three expressions for convergence of an elitist (fi+ X)EAibs on 
Royal Roads test function: exact, approximate and asymptotic. Although the 
exact expression for the expected convergence time clearly shows the benefit of 
increase in the population (at least when the population is relatively small), the 
approximate result has an equal order of population and asymptotic has none 
at all due to cancellation. 

An important assumption for the approximation of Et^u e4ibs was that 
fi « n, but we never specified the relation, unlike in |CTCYlI] . This is 
something to look at in the future. Since the effect of the population is known 
to be problem-specific, we will be able to get good insights into it for unimodal 
functions with plateaus, such as Royal Roads. 

We have performed our analysis assuming Uniform distribution of elite species 
in the population, something noone seems to have done in EA literature before. 
This is a static approach to convergence (i.e. the distribution assumption does 
not change throughout the run of the algorithm). We would like to look at the 
dynamics of the elite species and their effect on the probability of success, P(Gi) 
and expected convergence time. 
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A Taylor series approximation of the integrand 

We give the expression for the Equation ITUl here due to its length. It's Taylor 
series expansion of the integrand around midpoint of the interval (0.5) 



(f "2"i fl 4„2„ 2( M +1) W(M-2) \ (16M-32U 5 , 

M ^ -l^WfW))- 3 * +yi 4(M-2)<r 5 

where 

2(/i-l)(/i-l) 2 A^ 2 n 2 4/i 4 n 2 
o~i = — 



^2 A(f + l)a 3 A<T 2 a 3 (f + l) 

X(2^-1)(^+1) J3 M . M . 2 

o- 2 = e , 0-3 = —+n+kM-l, 04 = I — — h 1 ) , ct 5 = n + fcM-2 



Sl = 16(M - 2), S2 = 4A(2 M - l) 2 ((Af - 2)( — + 1) - a 3 (M - 2)), s 3 = L 

2(2^ - l) 3 (/i - 1) 2 (2M + 2n + 2feM - nM - kM 2 - 4) 

<Pi = 

S3CT2 



B Additional Derivation Details 

To derive expressions in Section 14.11 we extensively used properties of indepen- 
dent Geometric RVs that are not identically distributed, which is also known 
as Coupon collector's problem (see e.g. [GKP95 ): if Xi ~ Geom(pi) it ex- 
pectation is E[Xi] = jr. Therefore, if Y = Yn=i x U = £" =1 E[J*Q] = 
± + ± + + 

Pl P2 Pn 
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n 


K 


M 


fx 


A 












4 


4 


145 


315.31 


32 






10 


10 


72.4 


268.22 


4 


8 


20 


20 


44.2 


192.29 








30 


30 


34.5 


173.56 








4 


4 


570.62 


612.46 


64 






10 


10 


279.88 


497.93 


8 


8 


20 


20 


153.46 


454.47 








30 


30 


112.30 


372.04 








4 


4 


2264.36 


1365 


128 


16 




10 


10 


1048 


1239 


8 


20 


20 


570.44 


1091.5 








30 


30 


401.99 


949.4 



Table 2: Theoretical and computational bounds for (fi + X)EAfg S 

For Equation [1] we use the Law of total probability twice: first, conditioning 
on Hj, then on a: 

m rn n 

P(A) = P{A\B l )P{B l ) = P{A\Bi) PiB^PiCi) 

i—1 2—1 j — 1 

C Numerical results to verify Equation [4] 

Column f(^ + \j E A 1BS was obtained by running the algorithm with different pa- 
rameters 20 times, each run was 2000 generation each. The earliest achievement 
of the global minimum for each run was saved and then averaged over. 
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